A quantitative assessment of the Hadoop framework for analyzing massively parallel DNA sequencing data

نویسندگان

  • Alexey Siretskiy
  • Tore Sundqvist
  • Mikhail Voznesenskiy
  • Ola Spjuth
چکیده

[This corrects the article DOI: 10.1186/s13742-015-0058-5.].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Local Alignment Tool Based on Hadoop Framework and GPU Architecture

With the rapid growth of next generation sequencing technologies, such as Slex, more and more data have been discovered and published. To analyze such huge data the computational performance is an important issue. Recently, many tools, such as SOAP, have been implemented on Hadoop and GPU parallel computing architectures. BLASTP is an important tool, implemented on GPU architectures, for biolog...

متن کامل

Cloud Computing Technology Algorithms Capabilities in Managing and Processing Big Data in Business Organizations: MapReduce, Hadoop, Parallel Programming

The objective of this study is to verify the importance of the capabilities of cloud computing services in managing and analyzing big data in business organizations because the rapid development in the use of information technology in general and network technology in particular, has led to the trend of many organizations to make their applications available for use via electronic platforms hos...

متن کامل

Next generation sequencing is here now.

The availability of massively parallel DNA sequencers has brought the cost of sequencing genes to affordable levels but the cost of analyzing the huge amount of data has not decreased to the same extent. Thus, only analyzing the sequences of the genes relevant to the patient's condition makes the cost manageable. A panel of genes relevant to lymphedematous conditions is described.

متن کامل

mtDNA-Server: next-generation sequencing data analysis of human mitochondrial DNA in the cloud

Next generation sequencing (NGS) allows investigating mitochondrial DNA (mtDNA) characteristics such as heteroplasmy (i.e. intra-individual sequence variation) to a higher level of detail. While several pipelines for analyzing heteroplasmies exist, issues in usability, accuracy of results and interpreting final data limit their usage. Here we present mtDNA-Server, a scalable web server for the ...

متن کامل

Hadoop-BAM: directly manipulating next generation sequencing data in the cloud

Hadoop-BAM is a novel library for the scalable manipulation of aligned next-generation sequencing data in the Hadoop distributed computing framework. It acts as an integration layer between analysis applications and BAM files that are processed using Hadoop. Hadoop-BAM solves the issues related to BAM data access by presenting a convenient API for implementing map and reduce functions that can ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2015